A bioinformatics approach to elucidate conserved genes and pathways in C. elegans as an animal model for cardiovascular research

Cardiovascular disease (CVD) is a collective term for disorders of the heart and blood vessels. The molecular events and biochemical pathways associated with CVD are difficult to study in clinical settings on patients and in vitro conditions. Animal models play a pivotal and indispensable role in CVD research. Caenorhabditis elegans, a nematode species, has emerged as a prominent experimental organism widely utilized in various biomedical research fields. However, the specific number of CVD-related genes and pathways within the C. elegans genome remains undisclosed to date, limiting its in-depth utilization for investigations. In the present study, we conducted a comprehensive analysis of genes and pathways related to CVD within the genomes of humans and C. elegans through a systematic bioinformatic approach. A total of 1113 genes in C. elegans orthologous to the most significant CVD-related genes in humans were identified, and the GO terms and pathways were compared to study the pathways that are conserved between the two species. In order to infer the functions of CVD-related orthologous genes in C. elegans, a PPI network was constructed. Orthologous gene PPI network analysis results reveal the hubs and important KRs: pmk-1, daf-21, gpb-1, crh-1, enpl-1, eef-1G, acdh-8, hif-1, pmk-2, and aha-1 in C. elegans. Modules were identified for determining the role of the orthologous genes at various levels in the created network. We also identified 9 commonly enriched pathways between humans and C. elegans linked with CVDs that include autophagy (animal), the ErbB signaling pathway, the FoxO signaling pathway, the MAPK signaling pathway, ABC transporters, the biosynthesis of unsaturated fatty acids, fatty acid metabolism, glutathione metabolism, and metabolic pathways. This study provides the first systematic genomic approach to explore the CVD-associated genes and pathways that are present in C. elegans, supporting the use of C. elegans as a prominent animal model organism for cardiovascular diseases.


Selection of cardiovascular disease genes in human
There are several disorders that fall under the category of CVD.We selected the five most prevalent cardiac pathophysiologies for our analysis based on their significant contributions to CVD-related mortality which are as follows: cardiomyopathy, heart failure (HF), coronary artery disease (CAD), myocardial infarction (MI) and acute coronary syndrome (ACS) 11,41 .For obtaining CVD-associated genes in humans, we studied the GEO microarray datasets from the National Centre for Biotechnology Information (NCBI).Six datasets GSE42955, GSE57338, GSE18612, GSE64554, GSE66360, and GSE60993 belonging to cardiomyopathy, HF, CAD, MI, and ACS were selected for the study.Basic information on GEO Datasets used in the study is given in Table 1.The gene expression scores for the selected datasets were obtained by comparing data from patients with diseases and healthy individuals using the GEO2R tool which is an online analytical tool with an inbuilt R programme (https:// www.ncbi.nlm.nih.gov/ geo/ geo2r/) 42 .All the datasets were normalized and the statistical threshold of the p value was set as ≤ 0.05.Lastly, Log fold change was used as the primary index for DEG screening.In this investigation, Log fold change |0.25-1.5|was used as a DEG screening condition.The obtained up and downregulated genes of CVD were further used for pathway analysis and construction of the PPI Network.

Human CVD PPI network construction and hub gene identification
To understand the regulatory function of the genes, the protein-protein interactions (PPI) network has been constructed.The human CVD PPI network was constructed using the STRING database (https:// string-db.org/) 43 .The physical interaction data of STRING was then used for network construction, visualisation, and analysis through Cytoscape 3.6.0platform 44 .Finally, a PPI network of 1099 upregulated and 815 downregulated genes was constructed on the Cytoscape after the removal of isolated nodes and duplicate edges.The network's topological properties were analyzed using Network Analyser 45 and Cytohubba 46 plugin of Cytoscape.www.nature.com/scientificreports/ The most influential nodes of the network are considered hubs of the network.The highest degree nodes also having higher centrality values were identified as the hubs of the CVD PPI network.For identification of the key regulators (KRs) of the PPI network, first the network's most important genes according to their degrees (k) and the centrality measures (Cc, C B ) were identified.The top 20 genes of different topological properties were compared to identify KR genes 47,48 .

Identification of CVD-related orthologous genes in C. elegans
In order to investigate how many potential CVD-related genes and pathways are present in the C. elegans genome, the orthologous genes of most significant CVD-associated genes were identified.This was achieved by identifying the first Neighbours of KRs of the human CVD PPI network and finding its orthologous in C. elegans.The first neighbour nodes of KR genes in the human interactome are defined as proteins directly and physically interacting with KR proteins 49 .First neighbours play the same important roles in numerous PPI networks and signalling pathways as the associated KR proteins 49 .Therefore, for identifying orthologous genes of the most significant human CVD-related genes, the first neighbours of the essential key regulator genes were identified and extracted from the human CVD PPI Network.The identified 648 first neighbour genes that were highly significant were used for the identification of orthologous genes present in C. elegans.g: Profiler that allows mapping of orthologous genes across species has been a popular tool for identification of C. elegans-human orthologs 50 .In addition to g: Profiler, we utilized the OrthoList 2 database for identification of orthologous genes.OrthoList 2 (OL2) is a compendium of C. elegans-human orthologs compiled by a meta-analysis in 2018 51 .The HGNC-approved symbols of CVD genes of human is queried in g: Orth tool of g: Profiler (https:// biit.cs.ut.ee/ gprofi ler/ orth) and OrthoList 2 to obtain the C. elegans genes.

Orthologous gene PPI network construction and analysis
Animal models are employed to study the roles of specific genes or proteins associated with a disease with the speculation that the roles may be similar in humans.Comparison of human and model organism interactome can suggest that the proteins under study might play similar roles in human disease if their interaction is highly conserved 52 .Therefore, we constructed the PPI network of CVD-related C. elegans orthologous genes.For this, we utilized the STRING database (https:// string-db.org/) 43 .First, an interactome network of orthologous genes was constructed on STRING.The physical interaction data of STRING was then used for network construction, visualization, and analysis through the Cytoscape 3.6.0platform 44 .Finally, a PPI network of 581 nodes with 2519 connections (edges) was constructed on the Cytoscape after the removal of isolated nodes and duplicate edges.
Defining the fundamental topology of a network is generally done by analyzing its topological parameters, especially P(k), C(k), and C N (k) 53 .So, first, the fundamental topology of the orthologous gene network was defined by analyzing the above three parameters using the Network Analyzer app 45 in Cytoscape 3.6.0taking the network as an undirected network.The three parameters P(k), C(k), and C N (k) of the network were analyzed by plotting against the degree k, and their distribution behaviour was analyzed to define the topologies of the orthologous gene network.The centrality measures which include Closeness centrality (Cc) and betweenness centrality (CB) of every node in the network were analyzed, using the Network Analyzer app and CytoHubba app 46 in Cytoscape 3.6.0 to study the influence and control capabilities of the nodes in signal processing and their dominance in network integration.

Identification of essential key regulators of orthologous gene network
For identification of the key regulators of the PPI network, first, the network's most important genes according to their degrees (k) and the centrality measures (Cc, C B ) were identified.Accordingly, the highest degree nodes also had higher centrality values, so, they were identified as the most influential nodes and considered as hubs of the network.Therefore, the key regulators having the most significant roles in PPI network integration and stability would be among these most influential nodes and hubs in the network.
Hence, a group of top highest degree nodes, or hubs, that had the largest impact on the network from the level of the core network up to the level of motifs, were used to identify the network's important regulators.The hubs functioning as the backbone of the system-level structure and represented at every hierarchical level were found by manual hub tracing and were regarded to be significant regulators.

Analysis of modules
Modules of large PPI networks are defined as the set of statistics and functionally significant interacting genes.We constructed modules, using MCODE (Molecular Complex Detection) version 1.5.1 which follows the principle that highly connected regions (or clusters) of interaction networks are often complexes 54 .MCODE, identifies the clusters that are highly interconnected regions in a network.The default MCODE parameters i.e., "Degree cut-off = 2, " "node score cut-off = 0.2, " "k-score = 2, " and "max.depth = 100." were used for network scoring and cluster finding.Significant modules were identified and filtered out on the basis of MCODE score.

Comparative pathway analysis of human and C. elegans
Pathway-based analysis is critical for understanding the interaction of diseases at the molecular level.DAVID (https:// david.ncifc rf.gov/) which is a comprehensive set of functional annotation tool was used to identify GO terms and Pathways 55 .CVD-related DEG in case of human and C. elegans orthologous genes were used to perform gene enrichment analysis.Comparison of pathways and GO terms of human CVD-related genes and C. elegans orthologous genes was done to identify common pathways.

Identification and selection of CVD-related genes in humans
For gaining insights into the genes and pathways of CVD in model organism C elegans , six datasets belonging to cardiomyopathy, HF, CAD, MI and ACS were selected on the basis of high prevalence (Prevalence-CAD and ACS: 30%, MI: 5-30% for different age group, HF: 15-25% relative to age, cardiomyopathy: 0.2-0.5%)and its contribution to CVD-related mortality (Annual mortality rate-CAD and ACS: 5-6% per 1,00,000 population, MI: 34-42%, HF: 15%, Cardiomyopathy-6 to 8%) 11,41 .The details of all datasets used in the study are given in Table 1.The selected datasets were analyzed using GEO2R.For the screening of DEGs, the statistical threshold of p value was set as ≤ 0.05 and Log fold change was used as the primary index for filtering out the genes.Common DEGs shared by two datasets were identified and considered only once in the final gene list by removing duplicates.Finally, a total of 1099 upregulated and 815 downregulated genes associated with human CVD were retrieved by analyzing the selected datasets (Supplementary file 1).The obtained DEGs were used for further study.

Human CVD-related gene network construction and hub gene identification
PPI network refers to the biochemical and biological activities of proteins in cells.It adds considerably to our understanding of cell physiology and disease association in the fields of biology and bioinformatics 4 .Therefore, a PPI network was constructed using 1099 upregulated and 815 downregulated genes identified to be associated with human CVD.In the PPI network, proteins are denoted by nodes and the association between the proteins is given by undirected edges.The results of network analysis reveal that the CVD PPI network comprises 1626 interacting nodes and 16,903 edges (Supplementary Fig. 1).Topological properties define the structural properties of a complex biological network 56 .Therefore, to understand the essential behaviour of the CVD PPI network we studied the topological properties using the Network Analyzer app and CytoHubba app in Cytoscape.The "hubs" of the network are defined as the nodes with the highest degree in the system and tend to have an essential role in the PPI network.After analysis, we identified the top 10 most important hub genes (IL6, IL1B, PTPRC, MYC, FN1, ITGAM, TLR4, STAT3, JUN, and CXCL8) from the CVD PPI network as per the decreasing value.Similarly, analysis of topological properties of the CVD PPI Network resulted in the top 10 centrality measures (Betweenness and Closeness centrality) as listed in Table 2.
KR genes in a PPI network refer to a subset of genes that play pivotal roles in orchestrating and controlling various cellular processes through their interactions with other proteins.These genes exert significant influence over the network's dynamics and functionality, often acting as central hubs or master regulators 57 .Therefore, for the identification of essential KRs serving as the backbone of the CVD PPI network, the top 20 genes from three distinct topological features (degree, betweenness, and closeness centrality) were compared (Fig. 2A-C).
Eleven key regulators of the Human CVD that were common for all three attributes (Fig. 2D) were identified as follows: JUN, HSP90AA1, TLR4, CXCL8, MAPK3, PTPRC, IL6, IL1B, STAT3, MYC, and FN1 indicating that these genes might play important regulatory functions in CVD development and progression in human.Seven out of eleven KRs were found to be upregulated with IL1B having maximum fold change (2.87) while four genes were downregulated with HSP90AA1 having fold change of − 1.21 (Fig. 2E).The expression of KRs in control and disease groups is presented by a box plot in Fig. 3.The expression of IL1B, JUN, FN1, CXCL8, TLR4, PTPRC, and MAPK3 was significantly higher in diseased condition than in control while the expression of HSP90AA1, IL6, STAT3, and MYC was found to be lower in diseased than control.

Orthologous genes of human CVD in C. elegans
Completion of the C. elegans genome sequence in 1998 18 demonstrated that roughly 38% of worm genes have a human ortholog 58 .The percentage of human disease-related genes that have at least minor similarity (E < 10 −10 on BLASTP searches) with C. elegans genes ranges between 40 and 75% [19][20][21][22][23][24] .For identifying orthologous genes of most significant human CVD-related genes, the first neighbours of the essential key regulator genes were identified and extracted from the human CVD PPI Network.The first neighbour nodes of KR genes in the PPI network are the proteins that are directly and physically interacting with KR proteins 49 .Therefore, we selected the www.nature.com/scientificreports/first neighbour genes for C. elegans orthologue identification as they have similarly important roles in numerous PPI networks and signaling pathways as the associated KR proteins.A total of 637 first neighbour genes were found to be associated with the 11 KR proteins in the human CVD PPI network.Finally, 1113 C. elegans genes that were orthologous to 648 human CVD-related genes were identified (Supplementary file 2).The identified orthologous genes were used for the construction PPI network and comparative pathway study.

Orthologous gene network construction and analysis:
Many genes of different diseases in the genome of C. elegans have been extensively studied and their functions are well characterized.By studying the PPI network of CVD-related orthologous genes in C. elegans, we can infer the functions.Therefore, a PPI Network was constructed through STRING using the total C. elegans orthologous genes (Supplementary Fig. 2).In the PPI network, proteins are represented by nodes, and the relationship between proteins is shown by undirected edges.Further, to understand the structural properties of the PPI network based on network topology and connectivity patterns, Orthologous gene network analysis was done using the Network Analyzer app and the Cytohubba plugin of Cytoscape.The "hubs" of the network are defined as the nodes with the highest degree in the system.After analysis, we identified the top 10 most important hub genes (aha-1, daf-21, pmk-1, gpb-1, pmk-2, cyb-2.1,cyb-2.2,hif-1, eef-1G and cyb-1) from the PPI network.Similarly, analysis of topological properties of the CVD PPI Network resulted in the top 10 centrality measures (Betweenness and Closeness centrality) as listed in Table 3. Figure 4A-D shows the interaction of top 10 different Topological properties of orthologous gene network.
For the identification of essential key regulators of the CVD orthologous gene PPI network, the top 20 genes from four distinct topological features (degree, bottleneck, betweenness and closeness centrality) were compared (Fig. 4E).Ten genes: pmk-1, daf-21, gpb-1, crh-1, enpl-1, eef-1G, acdh-8, hif-1, pmk-2 and aha-1 were found to be common for all the four attributes.When comparison of the identified key regulator nodes of human and C. elegans was done, it was found that two orthologous genes (pmk-1 and pmk-2) of MAPK3 which is an important key regulator of human CVD is serving as key regulators in C. elegans PPI network as well.sta-1 which

Analysis of modules in orthologous network
Modules represent groups of proteins that tend to interact with each other more frequently than with proteins outside the module 59 .Modules can help us understand the dynamics of PPI networks.They may be transiently active in response to specific cellular conditions or environmental cues.Investigating module dynamics can reveal how the network adapts to changing circumstances.Hence, for determining the role of the orthologous genes in C elegans at various levels in the created network, the native or parent network was then divided into subnetworks or modules using the MCODE plug-in of the Cytoscape programme.The default MCODE parameters i.e., "Degree cut-off = 2, " "node score cut-off = 0.2, " "k-score = 2, " and "max.depth = 100." were used for network scoring and cluster finding.A total of 25 modules were identified, out of which 9 modules having MCODE score ≥ 5 were considered significant and filtered from the PPI network.Interestingly, the identified key regulators were found to be part of these modules (Fig. 5).Among the top 4 modules of Cardiovascular disease network, Module 1 has 24 nodes and 276 edges with a score of 24 (Fig. 5A) ; Module 2 has 49 nodes and 361 edges with a score of 15.042 and contains four regulatory gene (pmk-1,pmk-2,gpb-1,eef-1G) (Fig. 5B); Module 3 has 17 nodes and 114 edges with a score of 14.250 and has one key regulator genes acdh-8 (Fig. 5C) and Module 4 has 8 nodes and 25 edges with a score of 7.143 (Fig. 5D).

Comparative pathway analysis of human and C. elegans
In order to dissect the molecular mechanism of CVD in humans using C. elegans as an animal model, it is imperative to study the pathways that are conserved between the two species.In this study, we used the DAVID database to identify GO terms and KEGG pathways of the human CVD-related genes (Supplementary files 3 and 4) and its orthologous genes in C. elegans (Supplementary file 5).A comparison of pathways and GO terms of humans and C. elegans was done to identify CVD-related pathways that are present in C. elegans.The pathway and GO analysis of 1099 upregulated and 815 downregulated human genes and 1113 C. elegans orthologous genes showed that they are highly enriched in similar pathways.There are 9 commonly enriched pathways between humans and C. elegans that include Autophagy-animal, ErbB signaling pathway, FoxO signaling pathway, MAPK signalling pathway, ABC transporters, Biosynthesis of unsaturated fatty acids, Fatty acid metabolism, Glutathione metabolism, and Metabolic pathways.The list of pathways along with corresponding P-values in C. elegans and Humans are given in Fig. 6A.The total number of CVD-related genes and orthologous genes that are involved in these pathways are given by bar plot in Fig. 6B.The graph demonstrates that CVD-related genes and their orthologous are involved in metabolic pathways.Further, GO functional enrichment analysis also predicted similar functions in both the protein sets of human and C elegans.GO functions were classified into three categories.In each of these categories, several common functions were identified.We found 20 common Biological Processes describing the biological purpose of the gene or a gene product for carrying out a molecular function.Similarly, we also discovered 9 common Cellular Component and 13 common Molecular Functions through the comparative study of GO terms.Several important BP such as MAPK cascade, carnitine metabolic process, cell differentiation, chaperone-mediated protein folding requiring cofactor, fatty acid metabolic pathway, glutathione metabolic process, etc. are common among C. elegans and humans.The bubble plot in Fig. 7 illustrates the GO terms and KEGG pathways for C. elegans orthologous genes.The red colour highlights the enrichment of similar GO terms and Pathways in humans and C. elegans.As a result of common pathways and GO terms present between humans and C. elegans we speculate that C. elegans could serve as an animal model for studying cardiovascular diseases.

Discussion
With the rise in global incidences of CVD, it is becoming important to understand the CVD-related genes and pathways for gaining insights into the molecular and biochemical mechanism of the disease in order to develop efficient treatment strategies or preventative measures and lower the rate of incidence and morbidity.Various  www.nature.com/scientificreports/mammalian animal models are often employed to study CVD.In spite of having genomic relatedness, mammalian animal model systems have their own limitation such as high experimental cost, ethical issues, limited genomic tools available, complicated operation, and high reproductive environment demands 12 .A simple well-known model organism such as C. elegans whose genome has been completely sequenced and thoroughly studied 18 can give a comprehensive genomic evaluation of a wide range of human diseases including CVD 39,[60][61][62] .However, to date, we lack information regarding the exact number of CVD-related genes and pathways within the C. elegans genome.Likewise, the extent of genomic and functional similarities between C. elegans and humans in relation to CVD remains unknown, which restricts its comprehensive use as an animal model for cardiovascular disease research.
Since it is crucial to recognize the potential of the model organism as a valuable candidate for studying CVD, we have used a bioinformatics approach to compare the genes and pathways related to CVD of humans with those in C. elegans.We analyzed microarray datasets of patients to obtain CVD-related genes.A total of 1099 upregulated and 815 downregulated human CVD-associated genes were identified from gene expression analysis.In order to understand the cell physiology and disease connection of these DEGs, we constructed a PPI network.The network constructed from DEG shows hierarchical features, which means that the network has a system-level organisation involving modules which are interrelated.Since the network is hierarchical, their synchronisation exhibits various important functional regulations of the network.Significant genes (leading hubs) were recognized as key regulators of the network by influencing motifs and module regulation, indicating their biological significance.The leading hubs have significantly important functions.They integrate the lower degree nodes for organizing and regulating activities like inter and intra-cross-talk among other essential genes, maintaining network properties and stability, and optimizing the network signal processing 63,64 .
After comprehensive network analysis using the Network Analyzer app and Cytohubba plugin of Cytoscape, we identified the eleven proteins as central or key regulators within the human CVD network which are as follows: JUN, HSP90AA1, TLR4, CXCL8, MAPK3, PTPRC, IL6, IL1B, STAT3, MYC and FN1.Our results demonstrate that the expression of IL1B, JUN, FN1, CXCL8, TLR4, PTPRC, and MAPK3 was significantly higher in the diseased condition than control while the expression of HSP90AA1, IL6, STAT3, and MYC was found to be lower in diseased than control (Figs.2B and 3).There is increasing evidence suggesting that inflammation plays a role in the development of CVD.IL1B is a pro-inflammatory cytokine 65 .Many research investigations have shown that individuals with heart failure display elevated levels of inflammatory cytokines in their bloodstream, including IL-1B 65 .The product of the CXCL8 gene (commonly known as IL8) belongs to the CXC chemokine family and serves as a primary mediator of the inflammatory response 66 .Several research investigations have detected the presence of IL-8 at locations of vascular damage, while separate studies have suggested that IL-8 may be involved in different phases of atherosclerosis 66 .In the findings by Rus et al. elevated concentrations of IL-8 were reported within the atherosclerotic wall of human arteries 67 .In another study by Liu et al., it was identified that foam cells originating from human atherosclerotic tissues had elevated concentrations of IL-8 68 .Toll-like receptors (TLRs) are present in a variety of heart cell types, including cardiac myocytes, smooth muscle cells, and endothelial cells 69 .They have been reported to be involved in a variety of functions in diseased conditions such as CVD, apoptosis, allergic diseases etc 70 .Several research investigations have provided evidence that TLR4 activation leads to the upregulation of multiple genes associated with pro-inflammatory cytokines 71,72 , which have significant roles in promoting myocardial inflammation, especially in conditions such as myocarditis 73 , myocardial infarction, ischemia-reperfusion injury 74 , and heart failure 75 .Our result shows the upregulation of these inflammation-related genes except IL6 which is downregulated.The fact that these play a significant role as key regulators emphasizes the critical role of inflammation in CVD is an important finding of our study.
In addition to the identification of inflammation-related genes in the study, we have also uncovered noninflammatory genes that bear significance in the context of CVD such as JUN is found to be a key player implicated in the prevention of stress-induced maladaptive remodelling of the heart 76 .Additionally, MAPK3, a participant in intracellular mitogen-activated protein kinase (MAPK) signaling cascades, is believed to play an important role in the pathogenesis of cardiac and vascular diseases 77 .HSP90AA1, an isoform of heat shock protein 90, has been associated with its role in cardiac remodelling 78 .The presence of PTPRC has been noted in atherosclerotic plaque development 79 , while MYC has exhibited involvement in cardiac myotropy and angiogenesis 80 .FN1 has been linked to coronary artery disease 81 , and STAT3 has been recognized for its cardioprotective functions 82 .These genes, integral to cellular physiology, contribute significantly to diverse aspects of cardiovascular health and disease.Within the PPI network, key regulators (KRs) identified in this study offer valuable insights into the molecular mechanisms underpinning cardiovascular conditions.Consequently, these proteins emerge as promising targets for future research and therapeutic interventions in the development and progression of CVD.It is imperative to underscore the necessity for further experimental studies to validate these findings and elucidate the precise roles of these key regulators in cardiovascular health and disease.
For a better understanding of cell physiology and disease connection of orthologous genes in C. elegans we constructed the PPI network of orthologous genes.In the PPI network, proteins are represented by nodes, and the relationship between proteins is shown by undirected edges.Orthologous gene network analysis results identified the top 10 most important hub genes (aha-1, daf-21, pmk-1, gpb-1, pmk-2, cyb-2.1,cyb-2.2,hif-1, eef-1G and cyb-1) from the PPI network.Ten essential key regulator genes which are as follows: pmk-1, daf-21, gpb-1, crh-1, enpl-1, eef-1G, acdh-8, hif-1, pmk-2 and aha-1 were identified from the PPI network suggesting its crucial role.In diseased conditions such as cancer and various heart and lung diseases, cells and tissue suffer from low oxygen conditions (pathological hypoxia) 83 .Studies have reported that hypoxia activates hif-1 gene in C. elegans 84,85 .Similarly, in case of humans, HIF (ortholog of hif-1) is activated in hypoxic environment.This activation plays a central role in tissue repair, ischemia and cancer 86 .Additionally, results also show that ortholog of important CVD-related genes such as MAPK3, HSP90AA1, HSP90AB1, HSP90B1 (pmk-1, pmk-2, daf-21, enpl-1) playing important role in ortholog gene network.HSP90 has been found to be associated to several CVD.Its client proteins have been found to be involved in cardiac disease pathways such as MAPK signaling, TNF-α signaling, etc. 87 .A study of these proteins and signaling pathways using C. elegans ortholog can provide better insights into the molecular mechanism associated with the disease.
Results of comparative pathway analysis showed that there are 9 commonly enriched pathways that are present between humans and C. elegans.Among the common pathways, two belonged to lipid metabolic pathways "Biosynthesis of unsaturated fatty acid" and "Fatty acid metabolism".Pathways related to lipid metabolism play a very crucial role in CVD as dysregulation in any step can cause disorders such as diabetes, obesity, atherosclerosis, etc.A study by Zhang et al. clearly establishes C. elegans as a promising model for the study of lipid metabolism and lipid metabolic diseases 88 which is consistent with our result as dysregulation in lipid metabolism pathways is a risk factor for CVD.C. elegans and humans also have Autophagy-animal, ErbB signalling pathway, FoxO signalling pathway, MAPK signalling pathway, ABC transporters, Glutathione metabolism and Metabolic pathways that are commonly enriched that might play a role in human CVD development and progression thus making C. elegans as a suitable model organism to study the molecular events and pathways related to CVD.
Autophagy can also play a role in cardiovascular disease through several key signaling pathways including PI3K/Akt/mTOR, IGF/EGF, AMPK/mTOR, MAPKs, p53, Nrf2/p62, Wnt/β-catenin and NF-κB pathways 89 .Recent studies have revealed the mechanism underlying the therapeutic effects of NRG-1/ErbB signaling in the treatment of heart failure.Through activation of upstream signaling molecules such as phosphoinositide 3-kinase, mitogen-activated protein kinase, and focal adhesion kinase, NRG-1/ErbB pathway activation results in increased cMLCK expression and enhanced intracellular calcium cycling.The former is a regulator of the contractile machinery, and the latter triggers cell contraction and relaxation 90 .
FoxOs have unexpected and diverse roles in countering stress, determining cell fate, and regulating energy availability.Because the heart is constantly adapting to different stresses and metabolic conditions, FoxOs appear to play an extremely important role in cardiac physiology 91 .The role of the MAPKs extracellular signal-regulated kinase (ERK), C-jun N-terminal kinase (JNK), and p38 MAPK in cardiac hypertrophy, cardiac remodelling after myocardial infarction, atherosclerosis and vascular restenosis 77 Adenosine triphosphate (ATP)-binding cassette (ABC) transporters may play an important role in the pathogenesis of atherosclerotic vascular diseases due to their involvement in cholesterol homeostasis, blood pressure regulation, endothelial function, vascular inflammation, as well as platelet production and aggregation 92 .A molecule that actively participates in counteracting the oxidizing effect of reactive species is reduced glutathione (GSH), a tripeptide that is present in all tissues, and its synthesis and/or regeneration is very important to be able to respond to the increase in oxidizing agents 93 .
Further, GO functional enrichment analysis also predicted similar functions in both the protein sets of human and C elegans.In each of the GO categories, several common functions were identified.We found 20 common Biological Processes describing the biological purpose of the gene or a gene product for carrying out a molecular function.Similarly, we also discovered 9 common Cellular Component and 13 common Molecular Functions through the comparative study of GO terms.As a result of common pathways and GO terms present between humans and C. elegans that are related to CVD All these results indicate that C. elegans can be a potential animal model to study various pathways and genes that are associated with CVD.Consequently, the CVD-related genes present in C. elegans should be considered as a starting point, and further experimental investigations are required to fully explore C. elegans as a model organism for CVD.

Conclusion
In summary, we explored the applicability of C. elegans as an animal model to study CVD.We identified 1113 orthologous genes in C. elegans that are related to CVD in humans.We also explored the pathways and GO terms related to CVD that are common between humans and C. elegans.This study provides the first genomic view on CVD-related genes and pathways that are present in C. elegans, supporting its use as a prominent animal model for the study of CVD.

Figure 1 .
Figure 1.Schematic representation of the methodology of study showing the cardiovascular disease proteins were compared and analyzed.

Figure 2 .
Figure 2. Identification of key regulators using three attributes of CVD PPI Network.(A-C) Topological properties of the top 20 DEGs in terms of centrality measurements of degree, betweenness, and closeness.(D) Intersections among top ten genes having the highest centrality values of closeness, betweenness and degree.11 common genes among the top 20 genes of each of topological parameter.(E) Bar plot showing the fold change for expression of 11 key regulators in individual healthy control and CVD patients.The red bar shows log2 foldchange of upregulated genes and green bars shows log2 foldchange of downregulated genes.

Figure 4 .
Figure 4. C. elegans orthologous gene PPI regulatory network of the top ten hub genes of CVD.Regulatory network of the top ten (A) Degree genes (B) Betweenness centrality genes (C) Closeness centrality genes (D) Bottleneck genes.(E) Intersections among top 20 genes having the highest degree and centrality values of closeness, betweenness.10 common genes among the top twenty genes of each of topological parameter.

Figure 5 . 9 Figure 6 .
Figure 5. Top 4 modules of C. elegans orthologous gene PPI Network: (A) Module 1 has 24 nodes and 276 edges with a score of 24.000; (B) Module 2 has 49 nodes and 361 edges with a score of 15.042; (C) Module 3 has 17 nodes and 114 edges with a score of 14.250 and (D) Module 4 has 8 nodes and 25 edges with a score of 7.143.

Figure 7 .
Figure 7. Functional enrichment analysis of C. elegans orthologous genes.(A) The significant enriched biological process of targeted C. elegans orthologous genes.(B) The significantly enriched cellular component, (C) The significant enriched molecular function and (D) KEGG Pathway.Dot size indicate count.The count represents the number of genes associated with each process.The dot colour denotes the p values of process, and the x-axis represents the fold enrichment score.The red colour text represents the similar gene enrichment and Pathways in humans and C. elegans.

Table 1 .
Basic information on GEO Datasets used in the study.

Table 2 .
Top 10 highest Degree (Hub genes), Betweenness and Closeness centrality genes of the CVD PPI Network.

Table 3 .
Top 10 highest Degree (Hub genes), Betweenness and Closeness centrality genes of the C. elegans orthologous genes network. S.

no. Name Degree Name Betweenness centrality Name Closeness centrality
Vol:.(1234567890) Scientific Reports | (2024) 14:7471 | https://doi.org/10.1038/s41598-024-56562-9www.nature.com/scientificreports/ is orthologous to human gene STAT3 is also among the top 10 topological properties of C. elegans orthologous PPI Network.All these results indicate that identified orthologous genes may play important role in elucidating the mechanisms of cardiovascular disease, which are currently poorly understood and C. elegans may serve as a good animal model.